Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
Abstract Background Applying directed acyclic graph (DAG) models to proteogenomic data has been shown effective for detecting causal biomarkers of complex diseases. However, there remain unsolved challenges in DAG learning to jointly model binary clinical outcome variables and continuous biomarker measurements. Results In this paper, we propose a new tool, DAGBagM, to learn DAGs with both continuous and binary nodes. By using appropriate models, DAGBagM allows for either continuous or binary nodes to be parent or child nodes. It employs a bootstrap aggregating strategy to reduce false positives in edge inference. At the same time, the aggregation procedure provides a flexible framework to robustly incorporate prior information on edges. Conclusions Through extensive simulation experiments, we demonstrate that DAGBagM has superior performance compared to alternative strategies for modeling mixed types of nodes. In addition, DAGBagM is computationally more efficient than two competing methods. When applying DAGBagM to proteogenomic datasets from ovarian cancer studies, we identify potential protein biomarkers for platinum refractory/resistant response in ovarian cancer. DAGBagM is made available as a github repository at https://github.com/jie108/dagbagM .more » « less
-
Abstract Mosses (Bryophyta) are a key group occupying an important phylogenetic position in land plant (embryophyte) evolution. The class Bryopsida represents the most diversified lineage, containing more than 95% of modern mosses, whereas other classes are species‐poor. Two branches with large numbers of gene duplications were elucidated by phylogenomic analyses, one in the ancestry of all mosses and another before the separation of the Bryopsida, Polytrichopsida, and Tetraphidopsida. The analysis of the phylogenetic progression of duplicated paralogs retained on genomic syntenic regions in thePhyscomitrella patensgenome confirmed that the whole‐genome duplication events WGD1 and WGD2 were re‐recognized as the ψ event and the Funarioideae duplication event, respectively. The ψ polyploidy event was tightly associated with the early diversification of Bryopsida, in the ancestor of Bryidae, Dicranidae, Timmiidae, and Funariidae. Together, four branches with large numbers of gene duplications were unveiled in the evolutionary past ofP. patens. Gene retention patterns following the four large‐scale duplications in different moss lineages were analyzed and discussed. Recurrent significant retention of stress‐related genes may have contributed to their adaption to distinct ecological environments and the evolutionary success of this early‐diverging land plant lineage.more » « less
An official website of the United States government
